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Serratia plymuthica AS13 is a plant-associated Gammaproteobacteria, isolated from rapeseed 
roots. It is of special interest because of its ability to inhibit fungal pathogens of rapeseed and 
to promote plant growth. The complete genome of 5. plymuthica AS13 consists of a 
5,442,549 bp circular chromosome. The chromosome contains 4,951 protein-coding genes, 
87 tRNA genes and 7 rRNA operons. This genome was sequenced as part of the project enti- 
tled "Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on 
plant pathogens" within the 201 0 DOE-JGI Community Sequencing Program (CSP201 0). 



Introduction 



The members of the genus Serratia are widely dis- 
tributed in nature. They are commonly found in soil, 
water, plants, insects, and other animals including 
humans [1]. The genus includes biologically and 
ecologically diverse species - from those beneficial 
to economically important plants, to pathogenic 
species that are harmful to humans. The plant- 
associated species comprise both endophytes and 
free living taxa, such as S. proteamaculans, S. 
plymuthica, S. liquefaciens and S. grimesii. Most of 
them are of interest because of their ability to pro- 
mote plant growth and inhibit plant pathogenic fun- 
gl [2-6]. 



to its ability to stimulate rapeseed plant growth and 
to inhibit soil borne fungal pathogens such as Verti- 
cillium dahlia and Rhizoctonia solani [6]. Here we 
present a description of the complete genome of S. 
plymuthica AS13 and its annotation. 



A representative sequence of the 16S rRNA gene of S. 
plymuthica AS13 was compared with the most re- 
cently released GenBank databases using NCBI 
BLAST [7] under default settings. It showed that the 
strain AS13 shares 99-100% similarity with the ge- 
nus Serratia. When considering high-scoring seg- 
ment pairs [HSPs) from the best 250 hits, the most 
frequent matches were several unspecified Serratia 
strains [17.2%] with maximum identity of 97-100%, 
while S. plymuthica [5.2%] had maximum identity of 
97-100%, S. proteamaculans [4.8%] maximum iden- 
tity of 97-99%, S. marcescens [4.8%] maximum iden- 
tity of 96-97% and also different Rahnella strains 
[7%] maximum identity of 97-98%. 



Classification and features 



There are currently 16 validly named Serratia spe- 
cies. However, there are several unidentified plant- 
associated Serratia strains that have an impact on 
agriculture by stimulating plant growth and/or in- 
hibiting soil borne plant pathogens [3]. S. plymuthica 
AS 13 was isolated from rapeseed roots from Uppsa- 
la, Sweden. Our interest in 5. plymuthica AS13 is due 
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The phylogenetic relationship of S. plymuthica AS13 
is shown in Figure 1 in a 16S rRNA based tree. All 
Serratia lineages clustered together and were dis- 
tinct from other enterobacteria [except 
Obesumbacterium proteus). The tree also shows its 
very close relation with S. plymuthica strains AS9 
and AS12, which was confirmed by digital DNA-DNA 
hybridization values [12] above 70% when com- 
pared with the [unpublished] draft genome se- 
quence of the 5. plymuthica type strain Breed K-yr 
from a culture of DSM 4540, and when compared 
with the complete genome sequences of S. 
plymuthica AS9 [13] and S. plymuthica AS12 [14] 
using the GGDC web server [15]. 

Strain AS13 is a rod shaped bacterium, 1-2 |im long, 
0.5-0.7 |im wide [Figure 2 and Table 1], is Gram- 



negative, motile, and a member of the family 
Enterobacteriaceae. The bacterium is a facultative 
anaerobe and grows within the temperature range 4 
°C - 40 °C and within a pH range of 4 - 10. It has 
chitinolytic, cellulolytic, proteolytic, and 
phospholytic activity [6] and can easily grow on dif- 
ferent carbon sources such as glucose, cellobiose, 
succinate, mannitol, arabinose and inositol. It forms 
red to pink colored colonies that are 1-2 mm in di- 
ameter on potato dextrose agar at low temperature. 
The color of the bacterium depends on the growth 
substrate, temperature and pH of the culture medi- 
um [30]. The bacterium is deposited in the Culture 
Collection, University of Goteborg, Sweden [CCUG] 
as S. plymuthica AS13 [= CCUG 61398). 
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— Dickeya chrysanthemi (AJ23341 2) 

— Citrobacter freundii (AJ233408) 
Buttiauzlla agrestis (AJ233400) 

— Tatumella ptyseos (AJ233437) 
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Erwinia amyiovora (AJ23341 0) 

— Proteus vulgaris (AJ233425) 
Xanthomonas cucurbitae (Y1 0760) 
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Figure 1. Phylogenetic tree highlighting the position of S. plymuthica AS13 in relation to other 
genera within the family Enterobacteriaceae, based on 1,472 characters of the 16S rRNA gene 
sequence aligned in ClustalW2 [8]. The tree was constructed under the maximum likelihood cri- 
terion using MEGA5 software [9] and rooted with Xanthomonas cucurbitae (a member of the 
Xanthomonadaceae family). The branches are scaled based on the expected number of substitu- 
tions per site. The numbers above branches are support values from 1,000 bootstrap replicates if 
larger than 60% [10]. The lineages shown in blue color are the genome sequences of bacterial 
strains that are registered in GOLD [11]. 
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Figure 2. Scanning electron micrograph of 5. plymuthica AS13 



Chemotaxonomy 

Little is known about the chemotaxonomy of 5. 
plymuthica AS13. Fatty acid methyl ester [FAME) 
analysis showed the main fatty acid in strain AS13 
comprises Ci&o (25.27%], Ci&iaiyc (15.41%], Ci&icoyc 
(18.17%], Ci4.o (5.21%], Ci7.o cyclo (18.53%], along 
with other minor fatty acid components. Previously 
it has been shown that Serratia spp. contain a mix- 
ture of Ci4:o, Ci6:o, Ci6:i and Ci8:i+2 fatty acids in which 
50-80% of the total fatty acid in the cell is Ci4:o and 
other fatty acids are less than 3% each [31]. This is 
consistent with the fact that Ci4:o fatty acid is charac- 
teristic of the family Enterobacteriaceae. 

Genome sequencing information 

S. plymuthica AS13, a bacterial strain isolated from 
rapeseed roots was selected for sequencing on the 
basis of its biocontrol activity against fungal patho- 
gens of rapeseed and its plant growth promoting 
ability. The genome project is deposited in the Ge- 
nomes On Line Database [11] (GOLD ID = Gc01776] 
and the complete genome sequence is deposited in 
GenBank (INSDC ID = CP002775]. Sequencing, fin- 
ishing and annotation were performed by the DOE 
Joint Genome Institute (JGI]. A summary of the pro- 
ject information is shown in Table 2 and its associa- 
tion with MIGS identifiers. 

Growth conditions and DNA isolation 

S. plymuthica AS13 was grown in Luria Broth (LB] 
medium at 28 °C until early stationary phase. The 
DNA was extracted from the cells by using a 
standard CTAB protocol for bacterial genomic 
DNA isolation that is available at JGI [32]. 



Genome sequencing and assembly 

The genome of S. plymuthica AS13 was sequenced 
using a combination of lUumina and 454 sequencing 
platforms. The details of library construction and 
sequencing can be found at the JGI [32]. The se- 
quence data from Illumina GAii (1,457.3 Mb] were 
assembled with Velvet [33] and the consensus se- 
quence was computationally shredded into 1.5 kb 
overlapping fake reads. The sequencing data from 
454 pyrosequencing (79.5 Mb] were assembled with 
Newbler and consensus sequences were computa- 
tionally shredded into 2 kb overlapping fake reads. 
The initial draft assembly contained 86 contigs in 1 
scaffold. The 454 Newbler consensus reads, the 
Illumina Velvet consensus reads and the read pairs 
in the 454 paired end library were assembled and 
quality assessment performed in the subsequent 
finishing process by using software phrap package 
[34-37]. Possible mis-assemblies were corrected 
with gapResolution [32], Dupfinisher [38], or by 
sequencing cloned bridging PGR fragments with 
subcloning. The gaps between contigs were closed 
by editing in the software Consed [37], by PGR and 
by Bubble PGR primer walks (J.-F. Ghang, un- 
published]. Fifty one additional reactions were nec- 
essary to close gaps and to raise the quality of the 
finished sequence. The sequence reads from 
Illumina were used to correct potential base errors 
and increase consensus quality using the software 
Polisher developed at JGI [39]. The final assembly is 
based on 46.8 Mb of 454 draft data which provides 
an average 8.7 x coverage of the genome and 1,415.6 
Mb of Illumina draft data which provides an average 
262.2 X coverage of the genome. 
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Genome annotation 

The S. plymuthica AS13 genes were identified using 
Prodigal [40] as part of the genome annotation pipe- 
line at Oak Ridge National Laboratory (ORNL], Oak 
Ridge, TN, USA, followed by a round of manual 
curation using the JGI GenePRMP pipeline [41]. The 
predicted CDS were translated and used to search the 
National Center for Biotechnology Information 
[NCBI] nonredundant database, Uniport, TIGR-Fam, 
Pfam, PRIAM, KEGG, COG and InterPro databases. 
Non-coding genes and miscellaneous features were 
predicted using tRNAscan-SE [42], RNAmmer [43], 
Rfam [44], TMHMM [45], and signalP [46]. Additional 
gene prediction analysis and functional annotation 
was performed within the Integrated Microbial Ge- 



nomes - Expert Review (IMG-ER] platform developed 
by the Joint Genome Institute, Walnut Creek, CA, USA 
[47]. 

Genome properties 

The genome of 5. plymuthica AS13 has a single circu- 
lar chromosome of 5,442,549 bp with 55.96% GC 
content [Table 3 and Figure 3]. It has 5,139 predict- 
ed genes, of which 4,951 were assigned as protein- 
coding genes. Among them, most of the protein cod- 
ing genes (84.41%) were functionally assigned while 
the remaining ones were annotated as hypothetical 
proteins. 112 genes were assigned as RNA genes and 
76 as pseudogenes. The distribution of genes into 
COG functional categories is presented in Table 4. 



Table 1 . Classification and general features of 5. plymuthica AS1 3 according to the MIGS recommendations [1 6] 



MIGS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


IAS [1 7] 






Phylum Proteobacteria 


IAS [1 8] 






Class Gammaproteobacteria 


IAS [19,20] 




Current classification 


Order " Enterobacteriales" 


IAS [21] 






Family Enterobacteriaceae 


IAS [22-24] 






Genus Serratia 


IAS [22,25,26] 






Species Serratia plymuthica 


IAS [22,27] 






Strain AS! 3 


IDA 




Gram stain 


Negative 


IDA 




Cell shape 


Rod-shaped 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Non-sporulating 


IDA 




Temperature range 


Mesophilic 


IDA 




Optimum temperature 


28°C 


IDA 




Carbon source 


Glucose, inositol, arabinose, succinate, sucrose, fructose 


IDA 




Energy metabolism 


Chemoorganotrophic 


IDA 


MlGS-6 


Habitat 


Rapeseed roots 


IDA 


MlGS-6.3 


Salinity 


Medium 


IDA 


MIGS-22 


Oxygen 


Facultative 


IDA 


MIGS-15 


Biotic relationship 


Plant associated 


IAS [6] 


MIGS-14 


Pathogenicity 


None 


IDA 




Biosafety level 


1 


IAS [28] 


MIGS-4 


Geographic location 


LJppsala, Sweden 


NAS 


MIGS-5 


Sample collection time 


Summer 1 998 


IDA 


MIGS-4. 1 


Latitude 


59.8 


NAS 


MIGS-4.2 


Longitude 


17.65 


NAS 


MIGS-4. 3 


Depth 


0.1 m 


NAS 


MIGS-4.4 


Altitude 


24-25 m 


NAS 



a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the litera- 
ture); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally 
accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [29] . If the 
evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a 
live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements. 
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Table 2. Genome sequencing project information 



MIGS ID 


Property 


Term 




MIGS-31 


Finishing quality 


Finished 




MIGS-28 


Libraries used 


Three libraries: one 454 standard library, one paired en 
kb insert size) and one lllumina library) 


d 454 library (9.0 


MIGS-29 


Sequencing platforms 


lllumina GAii, 454 GS FLX Titanium 




MIGS-31. 2 


Fold coverage 


262.2 X lllumina, 8.7 x pyrosequencing 




MIGS-30 


Assemblers 


Newbler version 2.3, Velvet 1 .0.1 3, phrap version SPS 


- 4.24 


MIGS-32 


Gene calling method 
NCBI project ID 
INSDC ID 

Genbank Date of Release 
GOLD ID 
Project relevance 


Prodigal 1.4, GenePRIMP 

60455 

CP002775 

October 12, 2011 

Gc01776 

Biocontrol, Agriculture 





5300001 S'^OOOOi 100001 



5000001 
4900001 





3600001 

3500001 



= 1400001 



1900001 
2000001 



2800001 2700001 2600001 



Figure 3. Graphical circular map of the chromosome. From outside to the center: Genes on forw/ard 
strand (color by COG categories). Genes on reverse strand (color by COG categories), RNA genes 
(tRNAs blue, rRNAs red, other RNAs black), GC content, GC skew. 
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Table 3. Genome statistics 



A ttriKi tff^ 


V dlilC 
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vjtrllUIIItr dIZc \,UU^ 




1 UU.UU /o 
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L^IN/A 0 + *^-, LUlILclIL \,Up^ 


Q OA^ Aj^n 


DD.yO /o 
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1 DO DDo/, 

1 UU.UU /O 


ixl n/a fdtrl IcD 


1 1 Z 


Z . 1 O /o 


iixlN/A UUtrfUllD 


7 


n 1 A^L 

U. \ H /o 


riULtrll I-LUU 11 Ig ^cIlCD 




^D. J'H- /o 


r at: U U Ugt: 1 1 CD 


/ o 


1 .H-O /o 


vjfcrllCD III UdldlUU LIUdLcIj 


119 
1 1 z 


9 1 ^^0/. 
Z . 1 O /o 


Genes assigned to COGs 


DfOUD 


7/1 n/io/ 

/4.U4 /o 


Genes assigned in Pfam domains 


4,183 


81.39% 


Genes with signal peptides 


676 


13.15% 


Genes with transmembrane helices 


1,228 


23.89% 


CRISPR repeats 


1 


% of total a 



a) The total is based on either the size of the genome in base pairs or 
the total number of protein coding genes in the annotated genome. 



Table 4. Number of genes associated with the 25 general COG functional categories 
Code Value % age Description 



J 


201 


4.27 


Translation, ribosomal structure and biogenesis 


A 


1 


0.02 


RNA processing and modification 


K 


480 


10.20 


Transcription 


L 


161 


3.42 


Replication, recombination and repair 


B 


1 


0.02 


Chromatin structure and dynamics 


D 


37 


0.79 


Cell division and chromosome partitioning 


Y 


0 


0.00 


Nuclear structure 


V 


64 


1.36 


Defense mechanisms 


T 


187 


3.97 


Signal transduction mechanisms 


M 


265 


5.63 


Cell envelope biogenesis, outer membrane 


N 


94 


2.00 


Cell motility and secretion 


Z 


0 


0.00 


Cytoskeleton 


W 


0 


0.00 


Extracellular structure 


u 


116 


2.47 


Intracellular trafficking and secretion 


o 


153 


3.25 


Posttranslational modification, protein turnover, chaperones 


c 


272 


5.78 


Energy production and conversion 


G 


424 


9.01 


Carbohydrate transport and metabolism 


E 


470 


9.99 


Amino acid transport and metabolism 


F 


106 


2.25 


Nucleotide transport and metabolism 


H 


185 


3.93 


Coenzyme metabolism 


1 


135 


2.87 


Lipid metabolism 


P 


285 


6.06 


Inorganic ion transport and metabolism 


Q 


133 


2.83 


Secondary metabolite biosynthesis, transport and catabolism 


R 


537 


11.41 


General function prediction only 


S 


398 


8.46 


Function unknown 




918 


17.86 


Not in COG 
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